Issues in Indian languages computing in particular reference to search and retrieval in Telugu language
Identifieur interne : 000953 ( Main/Exploration ); précédent : 000952; suivant : 000954Issues in Indian languages computing in particular reference to search and retrieval in Telugu language
Auteurs : Devika P. Madalli [Inde] ; Dimple Patel [Inde]Source :
- Library Hi Tech [ 0737-8831 ] ; 2009-09-04.
Abstract
Purpose The purpose of this paper is to discuss the various issues involved in Indian languages computing, particularly Telugu, like creating, displaying, searching and retrieving digital content. The paper also aims to emphasize the issues involved in retrieval in Indian languages. The complexities presented by the grammar, syntax and morphology of Indian languages are discussed. Designmethodologyapproach The paper undertakes and presents descriptive study of the issues and challenges in Indian languages computing in general and Telugu language in particular. Findings The problem of multilingual information retrieval in Indian languages is multipronged. A major observation of this study is that, though digital content is available in Indian languages, it is mostly in nonstandard encoding format and fonts. There is an urgent need to work in the area of developing search algorithms for Indian languages, like soundex and metaphones to tolerate spelling variations and mistakes that a user might make in queries and suggest correct spellings. Practical implications With existing technologies libraries can now build online catalogues in the language of the documents or build digital repositories with content in various Indian languages. Though a few library automation software like NewGenLib and digital library software like DSpace, etc. are offering Unicode support for Indian languages, they do not allow for different types of search such as truncation search, word variants, etc. The present study is a step towards developing algorithms for indexing and searching in Indian languages. Originalityvalue The paper addresses various issues in Indian language computing with emphasis on search and retrieval.
Url:
DOI: 10.1108/07378830910988568
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000500
- to stream Istex, to step Curation: 000493
- to stream Istex, to step Checkpoint: 000475
- to stream Main, to step Merge: 000961
- to stream Main, to step Curation: 000953
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Issues in Indian languages computing in particular reference to search and retrieval in Telugu language</title>
<author><name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
</author>
<author><name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E12736709606DB346F098D7972E67B9F57E72FA7</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1108/07378830910988568</idno>
<idno type="url">https://api.istex.fr/document/E12736709606DB346F098D7972E67B9F57E72FA7/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000500</idno>
<idno type="wicri:Area/Istex/Curation">000493</idno>
<idno type="wicri:Area/Istex/Checkpoint">000475</idno>
<idno type="wicri:doubleKey">0737-8831:2009:Madalli D:issues:in:indian</idno>
<idno type="wicri:Area/Main/Merge">000961</idno>
<idno type="wicri:Area/Main/Curation">000953</idno>
<idno type="wicri:Area/Main/Exploration">000953</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Issues in Indian languages computing in particular reference to search and retrieval in Telugu language</title>
<author><name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
<affiliation wicri:level="1"><country xml:lang="fr">Inde</country>
<wicri:regionArea>Documentation Research and Training Centre, Indian Statistical Institute, Bangalore</wicri:regionArea>
<wicri:noRegion>Bangalore</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
<affiliation wicri:level="1"><country xml:lang="fr">Inde</country>
<wicri:regionArea>Department of Library & Information Science, Osmania University, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Library Hi Tech</title>
<idno type="ISSN">0737-8831</idno>
<imprint><publisher>Emerald Group Publishing Limited</publisher>
<date type="published" when="2009-09-04">2009-09-04</date>
<biblScope unit="volume">27</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="450">450</biblScope>
<biblScope unit="page" to="459">459</biblScope>
</imprint>
<idno type="ISSN">0737-8831</idno>
</series>
<idno type="istex">E12736709606DB346F098D7972E67B9F57E72FA7</idno>
<idno type="DOI">10.1108/07378830910988568</idno>
<idno type="filenameID">2380270310</idno>
<idno type="original-pdf">2380270310.pdf</idno>
<idno type="href">07378830910988568.pdf</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0737-8831</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract">Purpose The purpose of this paper is to discuss the various issues involved in Indian languages computing, particularly Telugu, like creating, displaying, searching and retrieving digital content. The paper also aims to emphasize the issues involved in retrieval in Indian languages. The complexities presented by the grammar, syntax and morphology of Indian languages are discussed. Designmethodologyapproach The paper undertakes and presents descriptive study of the issues and challenges in Indian languages computing in general and Telugu language in particular. Findings The problem of multilingual information retrieval in Indian languages is multipronged. A major observation of this study is that, though digital content is available in Indian languages, it is mostly in nonstandard encoding format and fonts. There is an urgent need to work in the area of developing search algorithms for Indian languages, like soundex and metaphones to tolerate spelling variations and mistakes that a user might make in queries and suggest correct spellings. Practical implications With existing technologies libraries can now build online catalogues in the language of the documents or build digital repositories with content in various Indian languages. Though a few library automation software like NewGenLib and digital library software like DSpace, etc. are offering Unicode support for Indian languages, they do not allow for different types of search such as truncation search, word variants, etc. The present study is a step towards developing algorithms for indexing and searching in Indian languages. Originalityvalue The paper addresses various issues in Indian language computing with emphasis on search and retrieval.</div>
</front>
</TEI>
<affiliations><list><country><li>Inde</li>
</country>
</list>
<tree><country name="Inde"><noRegion><name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
</noRegion>
<name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000953 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000953 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:E12736709606DB346F098D7972E67B9F57E72FA7 |texte= Issues in Indian languages computing in particular reference to search and retrieval in Telugu language }}
This area was generated with Dilib version V0.6.32. |